Backstitch: Counteracting Finite-Sample Bias via Negative Steps
نویسندگان
چکیده
In this paper we describe a modification to Stochastic Gradient Descent (SGD) that improves generalization to unseen data. It consists of doing two steps for each minibatch: a backward step with a small negative learning rate, followed by a forward step with a larger learning rate. The idea was initially inspired by ideas from adversarial training, but we show that it can be viewed as a crude way of canceling out certain systematic biases that come from training on finite data sets. The method gives⇠ 10% relative improvement over our best acoustic models based on lattice-free MMI, across multiple datasets with 100-300 hours of data.
منابع مشابه
Bias properties of Bayesian statistics in finite mixture of negative binomial regression models in crash data analysis.
Factors that cause heterogeneity in crash data are often unknown to researchers and failure to accommodate such heterogeneity in statistical models can undermine the validity of empirical results. A recently proposed finite mixture for the negative binomial regression model has shown a potential advantage in addressing the unobserved heterogeneity as well as providing useful information about f...
متن کاملInterpreting Ambiguous Social Situations in Social Anxiety: Application of Computerized Task Measuring Interpretation Bias
Background and Aims: The interpretation bias which is an important factor in the pathology of social anxiety disorder, has been recently considered in therapeutic approaches. Given the importance of interpretation bias in the treatment of social anxiety, and despite the ambiguity in the relationship between social anxiety and interpretation bias, we compared the interpretation bias in individua...
متن کاملReducing Prejudice Through Brain Stimulation.
BACKGROUND Social categorization and group identification are essential ingredients for maintaining a positive self-image that often lead to negative, implicit stereotypes toward members of an out-group. The medial prefrontal cortex (mPFC) may be a critical component in counteracting stereotypes activation. OBJECTIVE Here, we assessed the causal role of the mPFC in these processes by non-inva...
متن کاملMulti-step learning and underlying structure in statistical models
In multi-step learning, where a final learning task is accomplished via a sequence of intermediate learning tasks, the intuition is that successive steps or levels transform the initial data into representations more and more “suited" to the final learning task. A related principle arises in transfer-learning where Baxter (2000) proposed a theoretical framework to study how learning multiple ta...
متن کاملمقایسه روشهای مداخلهای غیرحضوری اصلاح سوگیری شناختی استاندارد، اصلاح سوگیری شناختی مبتنی بر روش خودزایی، و آموزش شناختی رفتاری بر دانشجویان افسرده
Objectives A recent method that modifies the intrusive memories is cognitive bias modification. This study aimed to investigate and compare the three none-attendance therapies of standardized cognitive bias modification, cognitive bias modification based self-generation and cognitive-behavior training. Methods According to inclusion and exclusion criteria, a total of 51 participants were selec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017